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Horizontal gene transfer is recognized as an important factor in genome evolution, particularly when the 
newly acquired gene confers a new capability to the recipient species. We identified a gene similar to the 
makes caterpillars floppy (mcfl and mcf2) insect toxin genes in Photorhabdus, bacterial symbionts of 
nematodes, in the genomes of the Epichloe fungi, which are intercellular symbionts of grasses. Infection by 
Epichloe spp. often confers insect resistance to the grass hosts, largely due to the production of fungal 
alkaloids. A mcf-like gene is present in all of the Epichloe genome sequences currently available but in no 
other fungal genomes. This suggests the Epichloe genes were derived from a single lineage- specific HGT 
event. Molecular dating was used to estimate the time of the HGT event at between 7.2 and 58.8 million years 
ago. The mcf-like coding sequence from Epichloe typhina subsp. poae was cloned and expressed in 
Escherichia coli. E. coli cells expressing the Mcf protein were toxic to black cutworms (Agrotis ipsilon), 
whereas E. coli cells containing the vector only were non- toxic. These results suggest that the Epichloe mcf- 
like genes may be a component, in addition to the fungal alkaloids, of the insect resistance observed in 
Epichloe-infected grasses. 



Horizontal gene transfer (HGT) between species is well established as an important factor in genome 
evolution 12 . Most examples of HGT involve transfers between prokaryotic species, but the number of 
examples of HGT into eukaryotes is increasing 3 . Among the eukaryotes, the chitin cell wall of fungi is 
considered a structural barrier, likely reducing the frequency of HGT, but even so there are numerous instances of 
HGT in fungi 3 . Most such instances are from prokaryotes, but there are some documented fungus-to-fungus 
transfers. In a systematic examination of 60 fungal genomes, representing four phyla, a total of 713 genes of 
prokaryotic origin were detected in 53 genomes 4 . The Ascomycota subphylum Pezizomycotina stood out as 
having a large number of HGT events. Richards et al. 3 analyzed the functions of 323 genes confirmed as 
originating from HGT into fungi, and concluded that HGT played a role in expanding the nutrient acquisition 
and environmental colonization capacities of many fungi. 

The amount of transferred DNA ranges from single genes to secondary metabolite gene clusters and even 
whole chromosomes 5 . The age of HGT events into fungi is also quite variable. For example, the transfer of a 
bacterial urea amidolyase gene into fungi was proposed to occur before the divergence of the subphyla 
Pezizomycotina and Saccharomycotina, approximately 400 million years ago (Mya) 6 . In contrast, transfer of 
the virulence gene ToxA from a fungal pathogen of wheat, Stagonospora nodorum, into the fungus Pyrenophora 
tritici-repentis, which resulted in the creation of a new wheat pathogen, was proposed to have occurred less than 
75 years ago 7 . 

Here we report the presence of an insect toxin gene similar to the bacterial makes caterpillars floppy genes (mcfl 
and mcf2) of Photorhabdus luminescens 8 ' 10 in the Epichloe fungal endophytes of grasses. Epichloe are ecologically 
and commercially important intercellular fungal symbionts of many cool season grass species 1112 . The fungi grow 
in between the plant cells in the above ground parts of the plant (Fig. 1). Infection by the Epichloe fungal 
endophytes often confers resistance to abiotic and biotic stresses 13 . The best understood effect of endophyte 
infection on the grass hosts is insect resistance, which is largely attributable to the production of toxic alkaloids by 
the fungi 1415 . Of the hundreds of fungal genome sequences available at NCBI, only the Epichloe have a mcf-like 
gene, suggesting it is the result of a lineage-specific HGT event from bacteria into a predecessor of the modern 
Epichloe. The gene is expressed and is functional in E. typhina subsp. poae Psl, suggesting it maybe a component, 
in addition to the fungal alkaloids, in the observed insect resistance of some endophyte-infected grasses. 
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Figure 1 | Leaf sheath epidermal peel of Poa secunda subsp. juncifolia 
infected with Epichloe typhina subsp. poae Psl stained with Rose Bengal. 

Fungal hyphae growing in between the plant cells are indicated by arrows. 

Results 

Detection of a makes caterpillars floppy-like Gene in Epichloe 
Genomes. Recently we reported a quantitative comparative 
transcriptome analysis of E. festucae-mfected versus endophyte- 
free Festuca rubra L. subsp. rubra (strong creeping red fescue) 
using the high throughput sequencing approach of SOLiD-SAGE 16 . 
Strong creeping red fescue is a commercially important grass species 
and is often naturally infected with the fungal endophyte E. 
festucae 17,18 . One SOLiD-SAGE tag recovered was from an E. 
festucae gene similar to the insect toxin genes makes caterpillars 
floppy 1 and 2 (mcfl and mcf2) of Photorhabdus luminescens 8,9 and 
the fitD toxin genes of some Pseudomonas spp. 19 . Ph. luminescens is a 
symbiotic bacterium of entomopathogenic nematodes. When the 
nematodes infect their insect host, the bacteria are regurgitated 
into the insect hemolymph where they release several toxins that 
kill the insect host 10 ' 20 . The mcfl gene was discovered in a screen of 
Ph. luminescens cosmid clones for insect toxicity and the name comes 
from the effect seen after normally non-toxic Escherichia coli 
expressing the gene were injected into caterpillars 8 , mcfl, also from 
Ph. luminescens, and fitD, from Pseudomonas protegens Pf-5 and 
CHAO isolates (previously called Pseudomonas fluorescens) 21 were 
discovered by their sequence similarity to mcfl and have been 
shown to have a similar effect on caterpillars 9,22 , mcfl has a 
1,623 bp N-terminal extension relative to mcf2 (Fig. 2a). All of 
these genes encode large proteins (2,388 to 2,996 amino acids) 
with a TcdA/TcdB pore-forming domain similar to that of 
Clostridium difficile toxin B 23 . Mcfl was found to kill insects by 
promoting apoptosis via the mitochondrial pathway 20 ' 24 . The Ps. 
protegens FitD toxin contributes to the oral insecticidal activity of 
the bacteria 25 . The mode of action of Mcf2 is unknown 9 . 

The genome sequences of 13 Epichloe spp. are publicly available 
(15; http://www.endophyte.uky.edu/) and all contain sequences sim- 
ilar to the E. festucae mcf-like gene. However, similar sequences are 
not found in any of the hundreds of non-Epichloe fungal genome 
sequences currently available in NCBI. The uniqueness of the mcf- 
like gene to the Epichloe lineage suggests it likely arose from a HGT 
event from a bacterial donor. The alternative scenario in which the 
Epichloe mcf-like genes originated from vertical gene transmission 
can be rejected since this would imply that the mcf-like gene was lost 
in all fungal lineages except the Epichloe. Diagrams of the Epichloe 
mcf-like genes are shown in Fig. 2a. The annotated position of the N- 
terminus of the proteins was variable, but most contained genes with 
long open-reading frames interrupted by 2 introns. The gene struc- 
ture of E. amarillans, E. baconii, E. brachyelytri, E. glyceriae, E. mollis, 



and E. aotearoae is considered the normal structure, based on our 
expression analysis of the gene from E. typhina subsp. poae infecting 
Poa secunda subsp. juncifolia (big blue grass), designated E. typhina 
subsp. poae Psl 26 ' 27 (discussed below). The E. elymi, E. festucae RC 
(the Rose City isolate of E. festucae), E. typhina subsp. poae 5819, and 
E. gansuensis isolates had mutations that resulted in frame shifts and 
early termination codons and so would not be expected to produce 
an intact Mcf-like protein. The E. gansuensis var. inebrians and E. 
typhina E8 isolates were annotated as having N-terminal extensions 
relative to most of the other species. In both cases the annotated 
extension can be attributed to a deletion of the presumed normal 
coding sequence start site and recognition of upstream open reading 
frames as exons. The E. festucae 2368 and Fll isolates had deletions of 
the 5' region of the gene and were annotated as beginning at a 
downstream region of the gene. The remarkable amino acid similar- 
ity of the predicted Mcf-like protein from E. typhina subsp. poae Psl 
to the bacterial Mcfl and Mcf2 proteins is illustrated in Fig. 2b. The 
similarity extends throughout the 1,997 amino acid long protein and 
is particularly striking in the TcdA/TcdB pore forming domain. 

The Epichloe mcf-like coding sequences were more similar in size 
to those of Ph. luminescens mcf2, but the overall number of amino 
acid identities in the region of overlap was similar between the 
Epichloe sequences and mcfl, mcf2, and fitD (Supplementary Table 
1). The nearly identical similarity of the Epichloe sequences to all 
three bacterial genes, as well as a preliminary phylogenetic analysis 
precluded identification of the likely donor in the HGT event (dis- 
cussed more below). 

As is typical of bacterial genes, there are no introns in the mcfl, 
mcf2, or fitD genes, but there are presumed introns in the Epichloe 
mcf-like genes. Since it appears the Epichloe mcf-like genes originated 
from a HGT event, the introns must have been acquired subsequent 
to the HGT. Such intron acquisition has been found in other eukar- 
yotic genes of bacterial origin 4,28 . Identical intron positions in genes 
from different recipient species suggest a single HGT event in the 
oldest member of the lineage, with subsequent vertical inheritance in 
the more recently diverged species 28 . All of the Epichloe mcf-like 
genes had an intron in the 3' region of the gene at an identical 
insertion site, although there is sequence variation within the intron 
sequence itself. Most also had a conserved 5' intron, except the E. 
gansuensis var. inebrians and E. typhina E8 isolates, both of which 
had deletions in the 5' sequence, which resulted in annotated N- 
terminal extensions. The E. festucae 2368 and Fll isolates, which 
had truncated 5' regions, also lacked the 5' intron. The two identical 
intron positions suggest there was a single HGT insertion as well as 
single intron acquisition events in the original horizontally trans- 
ferred mcf-like gene, which was subsequently vertically inherited 
throughout the speciation of the Epichloe. 

Estimation of the Age of the mcf-like Gene HGT Event into 
Epichloe. There have been numerous phylogenetic studies of 
Epichloe, but none have directly addressed the order and age of 
species divergence. To aid in estimation of the age of the mcf-like 
gene HGT event into Epichloe, we undertook such an analysis. Most 
phylogenetic studies of Epichloe have been based on P-tubulin 
(tubB), elongation factor EF-la (tefA), and y-actin DNA sequence 
comparisons. In a comprehensive survey that identified optimal 
fungal genes for use in phylogenetic analyses, these commonly 
used genes were not among the best 29 . The two most useful genes 
were MCM7, a subunit of the minichromosome maintenance 
complex 30 , and TSR1, a gene required for synthesis of 40S 
ribosomal subunits 31 . MCM7 and TSR1 have been found useful in 
fungal phylogenies that include a wide range of taxa 32 " 34 . Other high- 
performing genes identified were NAD -dependent glutamate 
dehydrogenase and isoleucine tRNA synthase 29 . 

Maximum parsimony phylogenetic analyses of MCM7, TSR1, 
NAD -dependent glutamate dehydrogenase, and isoleucine tRNA 
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Figure 2 | Gene structure of the Epichloe mcf-like genes and amino acid similarity of E. typhina subsp. poae Psl-Mcf protein with the bacterial Mcf 
proteins, (a) Diagrams of gene structure of the bacterial mcfl, mcf2, fitD, and Epichloe mcf-like genes. The exons are indicated by boxes and the introns by 
lines. The conserved TcdA/TcdB pore-forming region is indicated by filled in boxes. The positions of the premature stop codons in some of the Epichloe 
genes are indicated by *. The end of intron 1 of the E. typhina subsp. poae 5819 sequence was modified from the annotated version to that of the 
experimentally determined position in E. typhina subsp. poae Psl. (b) GECA analysis illustrating the amino acid sequence similarity of the E. typhina 
subsp. poae Psl -Mcf protein (1,997 amino acids; accession KJ502561) to Mcfl (2,929 amino acids; accession AF503504.2) and Mcf2 (2,388 amino acids; 
accession AY445665) proteins from Ph. luminescens. 



synthase sequences from 13 Epichloe, as well as some related 
Clavicipitaceae, are shown in Fig. 3 and Supplementary Fig. 1. 
Accession numbers of the sequences are given in Supplementary 
Table 2. To facilitate the alignments for these analyses the intron 
sequences were removed and only the DNA coding sequences were 
used. The Metarhizium anisopliae sequences were designated as the 
outgroups for rooting the trees since M. anisopliae is in a sister clade, 
which diverged prior to that of Epichloe 35 . In all of the trees, the two E. 
gansuensis sequences were in a sister clade to the rest of the Epichloe 
spp., which had E. glyceriae at its base. Since both E. gansuensis and E. 
glyceriae have the mcf-like gene in their genomes, the HGT event 
must therefore have occurred in the common ancestor of these two 
basal Epichloe spp. 

To estimate the age of the HGT event we used a molecular clock 
approach 36 similar to that used to estimate the age of polyploidy in 
Gossypium (cotton) 37 , the divergence times of maize LTR-retrotran- 
sposons 38 , and the age of a segmental duplication in maize 39 . First we 
estimated the divergence rate for each gene by calibration to 
Atkinsonella hypoxylon using the formula T = KJ2x. A. hypoxylon 
(synonym: Balansia hypoxylon) 40 is closely related to B. epichloe 41 . B. 
epichloe was placed in a sister clade to the clade containing E. typhina 
and the Claviceps spp. and the divergence of the two clades was 
estimated to have occurred 81 Mya 35 . The K s (substitutions at syn- 
onymous sites) and K a (substitutions at nonsynonymous sites) for 



the coding sequence of each gene relative to the A. hypoxylon ortho- 
log was determined (Supplementary Table 3). In this analysis glu- 
tamate dehydrogenase from E. gansuensis var inebrians was not 
included, since the gene sequence was split between two contigs, 
which introduced a gap into the alignment. The low K a /K s ratios 
for these four genes indicate they are under purifying selection. 
From the K s for each gene, the rate of divergence of synonymous 
sites (r) for each gene was calculated (Supplementary Table 3). The 
calculated rates varied from 5.93 X 10" 9 to 7.72 X 10" 9 substitutions 
per synonymous site per year, which are similar to reported fungal 
gene divergence rates 42 . 

The calculated rates of divergence of each gene were then used to 
calculate the time of divergence (T) between the Claviceps clade and 
the Epichloe clade as well as between E. glyceriae and E. gansuensis 
sequences (Supplementary Tables 4 and 5). The mean calculated 
divergence time between the Claviceps clade and the Epichloe clade 
was 58.8 Mya and the mean divergence time between E. gansuensis 
and E. glyceriae was 7.2 Mya. These estimates indicate that the single 
introduction of the mcf-like gene into Epichloe was sometime after 
the divergence of the Claviceps and Epichloe clades at 58.8 Mya and 
before the divergence of E. gansuensis and E. glyceriae at 7.2 Mya. 

The Epichloe mcf-like Gene Donor is Likely a Bacterium Related to 
Xenorhabdus or Photorhabdus. Phylogenetic analysis of genes 



SCIENTIFIC REPORTS | 4:5562 | DOI: 1 0.1 038/srep05562 



3 



MCM7 



100 



94 



{£. festucae 2368 
E. festucae FI1 
E. mollis 
E. baconii 
.E. amarillans 
E. brachyelytri 
E. typhina E8 

E. typhina subsp. poae 5819 
E. aoteroae 
E. elymi 
I— E. glyceriae 
-jqq -E. gansuensis 

LE. gansuensis var. inebrians 

100i C. paspa//' 

78 p i A ta/ce 



79 



-C. fusiformis 
— C. purpurea 
.A. hy poxy Ion 



M. anisopliae 
— 50 changes 



Figure 3 | Rooted 50% majority rule maximum parsimony phylogenetic 
tree of the MCM7 DNA coding sequences. The Metarhizium anisopliae 
sequences were designated as the outgroup for rooting the tree. The 
numbers at the nodes are the bootstrap percentages based on 1,000 
replications. The tree was based upon 2,472 total characters, of which 1,549 
were constant, 343 variable characters were parsimony uninformative, and 
580 variable characters were parsimony informative. The unmarked nodes 
all had bootstrap support of 69 or higher. 

derived from a HGT event can often be used to identify the donor 43,44 . 
The best matches to the Epichloe Mcf-like amino acid sequences were 
Mcfl and Mcf2 from Photorhabdus luminescens, and FitD from 
Pseudomonas spp. As discussed above, the overall similarity of the 
Epichloe Mcf-like sequences to the bacterial proteins was nearly the 
same (Supplementary Table 1). Similar Mcf2 proteins were also 
found in Xenorhabdus spp., also bacterial symbionts of 
entomopathogenic nematodes 10 , but have not been functionally 
characterized. Maximum parsimony phylogenetic analysis of the 
bacterial Mcf proteins and Epichloe mcf-like proteins is shown in 
Fig. 4. The Vibrio tubiashii sequence was chosen to root the the 
tree since its similarity to the Epichloe sequences is considerably 
lower than the Xenorhabdus, Photorhabdus, and Pseudomonas 
sequences. Similar trees were obtained when using the the 
neighbor joining and branch and bound methods (Supplementary 
Fig. 2). The sequence alignment used to generate the tree is shown in 
Supplementary Fig. 3. Accession numbers of the sequences used are 
given in Supplementary Table 2. 

In the phylogenetic tree the Mcf-like sequences from E. gansuensis 
var. inebrians and E. typhina subsp. poae were placed as basal to the 
Xenorhabdus spp. Mcf2 and Photorhabdus spp. Mcf proteins, mak- 
ing it impossible to hypothesize as to a specific bacterial donor. Such 
a phylogenetic placement suggests the bacterial donor may be an 
unknown or extinct bacterial species related to extant Xenorhabdus 
or Photorhabdus spp. In a phylogenetic comparison of many 
Pseudomonas spp., only one clade had fitD genes, and the authors 
suggested they were acquired through horizontal gene transfer 19 . The 
phylogenetic placement of the Pseudomonas FitD sequences as well 
as the overall length similarity to Mcfl of Photorhabdus spp., are 
consistent with a Photorhabdus sp. as the donor to Pseudomonas. 

Expression and Activity of the Epichloe mcf-like Gene. The 

recovery of a SOLiD-SAGE tag from Festuca rubra infected with E. 
festucae 16 supports the expression of the E. festucae mcf-like gene. To 
confirm expression, we amplified and sequenced the mcf-like cDNA 
from Festuca rubra infected with the Rose City isolate of E. festucae, 
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Figure 4 | Rooted 50% majority rule maximum parsimony phylogenetic 
tree of bacterial Mcf proteins and the Mcf-like protein sequences from E. 
gansuensis var. inebrians and E. typhina subsp. poae. The Vibrio tubiashii 
sequence was designated as the outgroup for rooting the tree. The numbers 
at the nodes are the bootstrap percentages based on 1,000 replications. The 
tree was based upon 1,983 total characters, of which 391 were constant, 404 
variable characters were parsimony uninformative, and 1,188 variable 
characters were parsimony informative. 

designated E. festucae RC. The expression level was extremely low 
and required three rounds of PCR to obtain enough sample for 
sequencing. Sequence analysis of the E. festucae RC mcf-like cDNA 
revealed a single base insertion resulting in a frame-shift and an early 
termination codon, relative to the E. festucae 2368 and Fll isolates. 
The single base insertion was confirmed by sequencing that region of 
the E. festucae RC genome. Although the mcf-like gene is expressed in 
the E. festucae RC isolate, it could not produce a full-length protein. 

We also analyzed expression of the mcf-like gene in E. typhina 
subsp. poae Psl, the fungal endophyte infecting Poa secunda subsp. 
juncifolia 26 . The E. typhina subsp. poae Psl -mcf cDNA generated 
from infected plant leaf sheath tissue was easily amplified with one 
round of PCR (Fig. 5a). Sequence analysis of the amplified cDNA 
revealed it could produce a full-length protein. The cDNA sequence 
also confirmed the positions of the two annotated introns. The intron 
sequences were determined by sequencing of genomic DNA. 

Since the mcf-like gene in E. typhina subsp. poae Psl (henceforth 
designated as E. typhina subsp. poae Psl-mc/) was expressed and was 
predicted to produce a full-length protein, its activity against insects 
was assessed. The toxicity of the bacterial Mcf proteins was deter- 
mined by injection of E. coli cells expressing the protein into insect 
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Table 1 | Cumulative numbers of black cutworm larvae deaths after 
each treatment. Insect assays were performed by injecting either 
sterile water, 5 X 10 7 total induced E. coli cells containing the 
pCold II vector only as a negative control or the Etp-Mcf::pCold 
II plasmid in a total volume of 5 ul. The experiments also included 
larvae that were untreated 
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hemocoel 8 ' 9,22 . The E. typhina subsp. poae Psl -mcf coding sequence 
was expressed in E. coli by cloning into the expression vector pCold 
II. This is the same strategy that was used to determine the activity of 
the Ps. protegens FitD protein 22 . A band of the expected size was seen 
on SDS-PAGE analysis of proteins from induced E. coli cultures 
(Fig. 5b). The induced protein was found in the insoluble fraction 
as was reported for the Ps. protegens FitD protein 22 . The toxicity of 
the induced protein was assayed by injecting black cutworms 
(Agrotis ipsilon) with 50 million E. coli cells harboring either the 
pCold II vector only or cells expressing the E. typhina subsp. poae 
Psl -Mcf protein. The E. coli cells containing the vector only were not 
toxic to the cutworms, whereas injection of the cells expressing the 
Mcf protein resulted in statistically significant levels of cutworm 
deaths when compared to the results for the vector only controls 
(Table 1; Fig. 6). Lower levels of E. coli cells (20 million) expressing 
the E. typhina subsp. poae Psl -Mcf protein were not toxic to the 
cutworms. 

Discussion 

Here we have reported the characterization of a mcf-like gene in the 
Epichloe fungal endophytes of grasses that is the result of a HGT 
event from a bacterial donor. The gene is present in the genomes 
of all 13 of the Epichloe spp. for which whole genome sequence data is 
available, as well as 2 additional isolates used in this study, but in no 
other fungal species. In 11 of the 15 Epichloe sequences a long pro- 
tein, containing the TcdA/TcdB pore-forming region, is predicted 
and in 4 of the Epichloe sequences there was a mutation leading to an 
early termination codon. In 3 out of these 4 cases, multiple isolates of 
the same species provided examples of early termination of the mcf- 
like gene and also of a gene capable of producing a potentially func- 
tional protein. A similar isolate specific breakdown of some alkaloid 
biosynthetic genes was seen among Epichloe spp. 15 . Phylogenetic 
analysis suggests the donor bacterium may have been an as yet 
unknown predecessor of extant Xenorhabdus or Photorabdus spp. 
The mechanisms underlying HGT events are not known, but clearly 
the donor and recipient species must be in close contact. It is easy to 
imagine that the recipient Epichloe sp. could have come in contact 
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Figure 5 | Expression of E. typhina subsp. poae Psl -mcf in vivo and in E. 
coli. (a) PCR product of the E. typhina subsp. poae Psl -mcf transcript using 
infected Poa secunda subsp. juncifolia plant cDNA generated from 
oligo(dT) as template, (b) SDS-PAGE analysis of insoluble E. coli proteins. 
Lane 1: E. coli containing the pCold II vector only subjected to induction 
conditions. Lane 2: E. coli containing the pCold 11:: E. typhina subsp. poae 
Psl -Mcf with no induction of protein expression. Lane 3: E. coli containing 
the pCold II::E. typhina subsp. poae Psl -Mcf subjected to induction 
conditions. Arrow indicates presence of the induced protein at the 
expected size of 223 kD. 



with the bacterial donor either in soil or in association with a plant 
host. 

HGT events are of particular significance when the newly acquired 
gene confers a new capability to the recipient species. A recent 
example is the transfer of a bacterial mannanase gene to the insect 
Hypothenemus hampei, the coffee berry borer beetle 43 . The acquisi- 
tion of the mannanase gene was considered to have led to the adapta- 
tion of the insect to a new host, the coffee berry, which is rich in 
mannans. Here we have demonstrated that injection of E. coli expres- 
sing the E. typhina subsp. poae Psl -mcf gene into black cutworms 
(Agrotis ipsilon), a pest of various crops, results in death of the insect, 
similar to the insecticidal activities of the bacterial mcfl, mcf2, and 
fitD genes 8,9 ' 22 . The bacterial Mcf proteins are considered critical 
components of the overall insect toxicity of the bacteria. The E. 
typhina subsp. poae Psl -Mcf protein has been maintained as an 
active protein over millions of years since its acquisition by an ances- 
tral Epichloe sp. This suggests it may also be a component, in addition 
to alkaloids that are well-established as having insecticidal activ- 
ity 15 ' 45 , in the insect resistance observed in plants containing this 
endophyte isolate 46 . Other Epichloe spp. have genes apparently cap- 
able of producing Mcf-like proteins that may also have activity 
against insects, as shown here for the E. typhina subsp. poae Psl 
isolate. 

Bacterial toxin genes of the aerolysin family (unrelated to the mcf 
toxin genes) were found to have undergone recurrent HGT to many 
diverse eukaryotes, possibly due to their adaptive value 47 . Moran 
et al. 47 proposed that genes derived from HGT are more likely to 
be maintained if they can function alone and are of immediate benefit 




Untreated Water Vector only Etp-Mcf 



Figure 6 | Toxicity of the E. typhina subsp. poae Psl -Mcf protein to black 
cutworms. Fourth instar cutworm larvae were injected with water, 50 
million E. coli cells containing the pCold II vector only, or 50 million E. coli 
cells expressing the E. typhina subsp. poae Psl -Mcf protein. The live 
cutworms curl in response to prodding. 
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to the organism. These characteristics apply to the Epichloe mcf 
genes, also derived from HGT, since the genes may contribute to 
the insect toxicity of Epichloe -infected plants. 

Based on phylogenetic analyses and divergence time estimates, the 
single HGT event into Epichloe was estimated to have occurred 
sometime between 7.2 and 58.8 Mya. Dating of fungal evolution is 
problematic due to the minimal fossil record, so as more fungal 
calibration points become available it will be possible to refine the 
divergence time estimates presented here. The rooted phylogenetic 
trees generated in these analyses, based on genes established as phy- 
logenetically high-performing genes 29 , are also revealing regarding 
the evolutionary history of Epichloe. In all four phylogenetic trees the 
Epichloe spp. separated into two clades, one comprised of the two E. 
gansuensis isolates and the other comprised of the rest of the Epichloe 
spp., with E. glyceriae at its base. The phylogenetic analyses presented 
here suggest that an as yet unknown (or possibly extinct) species was 
the predecessor of E. gansuensis and E. glyceriae and that the rest of 
the currently known Epichloe spp. were derived from E. glyceriae. 

It has been proposed that Epichloe spp. evolved through revolu- 
tion with their grass hosts 48 . The two most basal members of this 
genus, E. gansuensis (host grass Achnatherum inebrians, tribe 
Stipeae) and E. glyceriae (host grass Glyceria striata, tribe Meliceae) 
are symbionts of ancient tribes of the grass subfamily Pooideae. 
However, E. brachyelytri, the symbiont of Brachyelytrum erectum, 
which is in the most ancient Pooideae tribe Brachyelytreae 49,50 , was 
phylogenetically closer to the other Epichloe spp. than to E. gansuensis 
or E. glyceriae. These results indicate that the speciation of Epichloe 
was not exclusively through coevolution with their hosts since, in an 
exclusive model of coevolution, E. brachyelytri would be expected to 
be the most basal Epichloe sp. The phylogenetic trees presented here 
suggest that E. brachyelytri evolved via cross species transfer of an 
Epichloe sp. infecting a more recently evolved host grass to the 
ancient plant species B. erectum. Jackson 51 has also concluded that 
the Epichloe-host phylogenies are incongruent and do not support 
codivergence as a characteristic of the association. Lemaire et al. 52 , 
proposed host specificity without co- speciation for the Burkholderia 
bacterial leaf endosymbionts. The Epichloe - grass symbiosis may be a 
similar case. 

Methods 

Plant and Fungal Samples. Epichloe spp. -infected plants were grown by clonal 
propagation and maintained in 15 cm pots in the greenhouse. Each Epichloe sp. was 
isolated from its host plant by plating surface- sterilized leaf sheath tissue on potato 
dextrose agar (Difco Laboratories, Detroit, MI). 

DNA and RNA Isolation. Genomic DNA of Epichloe spp. was extracted from culture 
grown in potato dextrose broth for 14 days on a shaker (175 rpm) at room 
temperature (23-25°C). The DNA was isolated as previously described 53 . RNA was 
obtained from the innermost leaf sheath tissues of Epichloe-infected plants. RNA 
isolation was as previously described 16 . Nucleic acid concentration was measured 
using a Nanodrop ND- 1000 Spectrophotometer (Thermo Fisher Scientific, Waltham, 
MA). 

Amplification and Sequencing of Epichloe mcf-like Genes and cDNAs. Initial 
screening to determine the presence of the mcf-like gene in E. festucae RC and E. 
typhina subsp. poae Psl fungal isolates was done using fungal genomic DNA. Upon 
detection of the gene, first-strand cDNA from 5 jag Epichloe-infected plant total RNA 
was synthesized by using Superscript™ III Reverse Transcriptase (Life Technologies, 
Carlsbad, CA) and 500 ng of oligo(dT) 18 primer according to the manufacturer's 
instructions. PCR was performed in 50 uL with either 0.5 ug of fungal genomic DNA 
or 1 uL Epichloe-infected plant cDNA generated from oligo(dT) 18 , 1.25 units of 
PrimeSTAR HS DNA Polymerase, IX PrimeSTAR Buffer with Mg 2+ (TaKaRa Bio 
Inc., Shiga, Japan), 200 uM each dNTP, and 0.3 uM of each forward and reverse 
primer (Integrated DNA Technologies, Inc., Coralville, IA). Two-step PCR was 
performed by template denaturation at 98 °C for 10 seconds followed by 6 minutes 
extension at 68°C for 30 cycles in a GeneAmp 9700 thermal cycler (Applied 
Biosystems, Inc., Foster City, CA). The amplification products were visualized on a 
1% TBE agarose gel. The amplified genomic and cDNA PCR fragments were purified 
by using 0.5X Agencourt AMPure XP (Beckman Coulter, Inc., Brea, CA) to remove 
any fragments under 1 kb and then sequenced directly (Genewiz, Inc., South 
Plainfield, NJ). For each sequencing reaction, about 800 ng of purified PCR product 
in 10 uL was treated with 2 uL ExoSAP-IT (USB Corp., Cleveland, OH) to remove 



unincorporated primers and excess dNTPs. The ExoSAP-IT reaction was performed 
at 37°C for 15 min followed by heating at 80°C for 15 min to inactivate the enzymes. 

Expression of the E. typhina subsp. poae Psl-mc/Gene in E. coli. To test the anti- 
insect activity of the E. typhina subsp. poae Psl -Mcf protein, the cDNA was cloned 
into the expression vector pCold II (TaKaRa Bio). The E. typhina subsp. poae Psl 
cDNA was amplified by PCR as described above with oligonucleotides that 
introduced a Kpnl site at the 5' end and an Eco RI site at the 3' end. The sequences of 
the oligos are: forward 5 ' - GATATAACCATGGCTCACAACACT- 3 ' and reverse 

5 ' -TCAACTGAATTCCTACTGATTTCCAGC-3 ' . Two hundred ng of the puri- 
fied PCR product was digested with the restriction enzyme Kpnl (TaKaRa Bio), 
purified with 0.35X Agencourt AMPure XP (Beckman Coulter, Inc) to remove any 
fragments under 1.5 kb, digested with Eco RI, and again purified with 0.35X AMPure 
XP. The expression vector pCold II was similarly digested with Kpnl and EcoRl. The 
digested pCold II plasmid was treated with shrimp alkaline phosphatase (Affymetrix, 
Santa Clara, CA) to prevent vector religation. Overnight ligation of the digested E. 
typhina subsp. poae Psl -mcf PCR fragment and pCold II plasmid was done at a 3 : 1 
insertvector molar ratio using T4 DNA ligase (New England Biolabs, Inc., Ipswich, 
MA). 

Two uL of the ligation product was used to transform 20 ul E. coli DH5a elec- 
troporation-competent cells. The transformed cells were incubated in SOC medium 
for 1 hour at 37°C, followed by overnight growth of cells on LB medium supple- 
mented with 100 ug/mL ampicillin. Transformed bacterial colonies were screened for 
plasmids containing the E. typhina subsp. poae Psl-mcf gene insert by using PCR as 
described above. Selected clones were grown in 3 mL of LB + ampicillin broth 
overnight followed by plasmid purification using QIAprep Spin Miniprep Kit 
(Qiagen, Valencia, CA). DNA sequencing of the plasmids were done as described 
above. A plasmid containing the correct sequence was transformed into E. coli BL2 1 - 
CodonPlus (DE)-RIPL competent cells (Agilent Technologies, Santa Clara, CA). For 
use as a control, a transformant containing the pCold II vector only was also 
generated. 

For expression of the E. typhina subsp. poae Psl -Mcf protein, the BL2 1 -CodonPlus 
(DE3)-RIPL transformant was induced by cold shock and isopropyl-(3-D-thiogalac- 
topyranoside (IPTG). We detected loss of E. typhina subsp. poae Psl-mcf::pCold II 
recombinant plasmid in E. coli cells during the course of long induction periods. This 
was not observed in E. coli cells containing the pCold II vector only. Previous studies 
have shown that carbenicillin and ampicillin, even in high concentrations, quickly 
lose their ability to maintain selective pressure due to saturation of the media with the 
antibiotic degrading enzyme P-lactamase 54 ' 55 . Extra metabolic burden placed on the 
bacterial cells to maintain the large recombinant plasmid combined with the steady 
decrease of antibiotic in culture may have contributed to the plasmid loss that we 
observed. In order to circumvent this problem, the culture supernatant was replaced 
with fresh medium at regular intervals. 

To induce expression a 10 mL culture of the cells was grown for 15 h at 30°C in LB 
and 200 ug/mL carbenicillin. A 400 uL aliquot of the culture was pelleted, resus- 
pended in 400 uL, added to 10 mL of fresh LB plus carbenicillin and incubated at 
30°C to an OD of 0.4-0.5. The culture was then cold-shocked for 15 minutes on ice 
and subsequently pelleted in a Model J2-21 centrifuge (Beckman Coulter, Inc., Brea, 
CA) set to 5°C to obtain the bacterial cells. Fresh LB broth supplemented with 
carbenicillin was used to resuspend the bacterial pellets. To induce protein expression 
in the culture, IPTG was added to 0. 1 mM and the culture was then incubated at 1 5°C 
with rotational shaking for 2 1 -24 h. The bacterial cells were pelleted and resuspended 
in fresh LB and carbenicillin every 3 h for the initial 6 h and again after 12 h to 
prevent ^-lactamase saturation in the culture supernatant. The transformant con- 
taining pCold II vector only was treated the same way. 

For sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) 
analysis, 1 ml each of E. coli cells expressing the E. typhina subsp. poae Psl -Mcf 
protein and the vector only control cultures were pelleted and lysed using IX 
FastBreak Cell Lysis Reagent (Promega, Madison, WI) following the manufacturer's 
protocol. Following lysis, samples were centrifuged at 10,000 RPM for 15 minutes and 
the supernatants removed. The proteins remaining in the pellet were solubilized by 
adding 100 uL of 2X SDS sample buffer 56 , boiled for 5 minutes, and 30 uL was then 
subjected to electrophoresis on a 10% gel. 

Insect Assays to Determine Toxicity of E. typhina subsp. poae Psl -Mcf Protein. 

BL21 -CodonPlus (DE3)-RIPL cells expressing the E. typhina subsp. poae Psl- 
Mcf::pCold II recombinant insert and vector only control were prepared as described 
above. Cells were pelleted and resuspended in sterile water. Black cutworm (Agrotis 
ipsilon Hufnagel), eggs were obtained from Benzon Research Inc. (Carlisle, PA) and 
reared on insect diet (Southland Products Inc. Lake Village, AR) at 27°C and 14 : 10 
(L : D) hours to fourth instars. Immediately before injection the larvae were surface 
sterilized in 0.8% sodium hypochlorite for 30 seconds and then rinsed in sterile 
distilled water. Six to ten fourth instars were injected with 5 X 10 7 E. coli cells 
containing either the E. typhina subsp. poae Psl-Mcf::pCold II induced protein or the 
pCold II vector only control. Additional controls were water injections and untreated 
samples. The assay was conducted twice. Injections were in a total volume of 5 uL 
using a Hamilton microsyringe with a 30-gauge needle. The larvae were then placed 
individually in 30 mL plastic cups with 10 g of moist (10% w/v) pasteurized (3 h at 
72°C) sand with about 1 cm 2 diet and incubated at room temperature (22-24°C). 
Mortality was assessed at 24, 48, 72, and 120 h after injection. Larvae were considered 
dead when they no longer reacted to being poked repeatedly with a probe. The data at 
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the final time points were analyzed by Fisher's exact test to compare the toxin- 
expressing samples to the vector only controls. Significance was evaluated at P < 0.05. 

Schematic Representation of mcf Gene Structure. Schematic representation of the 
mcf gene structures was done using Gene Structure Display Server (57; http://gsds.cbi. 
pku.edu.cn/index.php ?input= site). Illustrative comparison of amino acid similarity 
for Mcfl, Mcf2 and E. typhina subsp. poae Psl-Mcf protein was performed using 
Gene Evolution and Conservation Analysis (GECA) (58; http://peroxibase.toulouse. 
inra.fr/ tools/ geca_input_demo .php ) . 

Phylogenetic Analysis. To identify genes in addition to MCM7 and TSR1 for use in 
the phylogenetic analyses, the FUNYBASE genome database (http://genome.jouy. 
inra.fr/funybase/) 59 was searched for single copy orthologs identified as 
phylogenetically high-performing genes by Aguileta et al. 29 . Two genes with 
topological scores of 96 or above and that also had significant sequence 
polymorphism within Epichloe were identified, FG570 (NAD-dependent glutamate 
dehydrogenase) and MS444 (isoleucine tRNA synthase). DNA sequences for these 
genes were obtained from the Genome Project at the University of Kentucky website 
(http://www.endophyte.uky.edu/) or from NCBI. 

The Clustal-X program 60 was used to align the DNA coding sequences. The 
alignments generated by Clustal-X were modified manually to minimize gaps. The 
phylogenetic analyses were performed with the PAUP* program, version 4.0b 10 for 
Macintosh. The phylogenetic analyses were done by using the maximum parsimony 
full heuristic search option set to random sequence addition, tree-bisection-recon- 
nection (TBR) branch swapping, and Multrees on, with 1,000 bootstrap replications. 
Gaps were treated as missing data. 

The K s and K a values based on pairwise comparison between species were deter- 
mined by using the MEGA5.2 program with the Nei-Gojobori, Jukes-Cantor model 61 . 
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